Using Google's Keyword Relation in Multidomain Document Classification
نویسندگان
چکیده
منابع مشابه
Using Google’s Keyword Relation in Multi- Domain Document Classification
People can collect all kinds of knowledge from search engines to improve the quality of decision making, and use document classification systems to manage the knowledge repository. Document classification systems always need to construct a keyword vector, which always contains thousands of words, to represent the knowledge domain. Thus, the computation complexity of the classification algorithm...
متن کاملDocument Image Retrieval Based on Keyword Spotting Using Relevance Feedback
Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...
متن کاملKeyword-based document clustering
Document clustering is an aggregation of related documents to a cluster based on the similarity evaluation task between documents and the representatives of clusters. Terms and their discriminating features of terms are the clue to the clustering and the discriminating features are based on the term and document frequencies. Feature selection method on the basis of frequency statistics has a li...
متن کاملText Classification Using Stochastic Keyword Generation
This paper considers improving the performance of text classification, when summaries of the texts, as well as the texts themselves, are available during learning. Summaries can be more accurately classified than texts, so the question is how to effectively use the summaries in learning. This paper proposes a new method for addressing the problem, using a technique referred to as ’stochastic ke...
متن کاملA Document Content Extraction Model Using Keyword Correlation Analysis
Owing to the drastic development of the information and Internet technologies, large amount of information and documents can be easily accessed through the electronic network. In addition to the efficiency of document acquisition, another typical issue for document management is the document content extraction. In order to provide the critical contents of a document to the knowledge requester, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Data Mining & Knowledge Management Process
سال: 2013
ISSN: 2231-007X,2230-9608
DOI: 10.5121/ijdkp.2013.3403